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Integration  of  Complex  Information 

Douglas  S.  Lange 
United  States  of  America 
Space  and  Naval  Warfare  Systems  Center 
53560  Hull  Street 
San  Diego,  CA  92152-5001 

Tel:  +1(619)  553-6534,  e-mail:  dlange@spawar.naw.mil 

Abstract:-  An  intelligence  support  system  has  been  developed  using  an  open  hypermedia 
architecture.  This  approach  provides  a  means  to  integrate  information  from  distributed 
disparate  sources  into  a  knowledge  base.  Access  by  external  applications  is  supported  through 
a  public  interface.  Filtering  and  change  detection  functions  have  also  been  implemented.  The 
approach  has  shown  promise  in  multiple  domains  indicating  that  it  may  be  widely  applicable. 

The  integration  of  systems  on  a  network  was  accomplished  early  in  this  decade.  The  result 
has  been  the  creation  of  information  stovepipes  all  capable  of  being  executed  from  a  common 
platform.  It  is  left  to  the  system’s  user  to  make  a  mental  picture  of  the  current  situation  by 
comparing  the  results  displayed  from  several  applications.  Using  open  hypermedia  technology, 
information  from  a  variety  of  distributed  applications  and  databases  can  be  associated  without 
modifying  the  original  data.  In  this  way,  information  integration  is  achieved,  bringing  diverse 
information  elements  together  to  form  a  semantic  network  available  to  humans  and 
autonomous  applications.  Using  this  approach,  analysts  can  model  situations  more  robustly 
than  previously  accomplished  with  relational  databases.  Consumers  of  intelligence  products 
are  able  to  use  a  filtered  perspective  of  the  model  through  hypermedia  functions  and  launch 
autonomous  agents  to  traverse  the  semantic  network  finding  particular  insights.  Supported  by 
an  open  architecture  and  public  interfaces,  the  model  is  available  to  external  applications 
including  geographic,  temporal,  relational,  and  browsing  displays.  Further,  hypermedia 
research  provides  a  variety  of  user  interaction  models  that  can  be  implemented  as  external 
applications. 

Introduction 

Command  and  control  involves  three  fundamental  processes  that  fit  together  in  a  tight  cycle. 
Situation  analysis  provides  the  context  on  which  to  act.  Decisions  are  made  based  on  the 
analysis  results.  These  decisions  constitute  planned  movements,  engagement  orders,  and  many 
other  possible  actions.  Decisions  must  be  communicated  to  those  who  are  to  carry  out  the 
actions.  The  results  of  these  actions  are  observed  and  folded  into  a  new  situation  analysis. 

As  Command,  Control,  Communications,  Computers,  Intelligence,  Surveillance,  and 
Reconnaissance  (C4ISR)  systems  have  evolved,  system  integration  has  been  the  general  theme. 
Stand-alone  systems,  each  with  its  own  database,  were  first  interfaced  to  allow  some  transfer 
of  data.  Data  management  schemes  were  implemented  to  provide  some  consistency  among 
databases  and  operational  units.  System  federation  gradually  allowed  multiple  applications  to 
run  on  users’  workstations,  preventing  the  need  for  specialised  hardware  and  support  software 
for  large  numbers  of  individual  systems.  The  current  state  of  system  integration  allows  not 
only  allows  multiple  applications  to  share  hardware,  operating  system,  and  network  platforms, 
but  also  utilises  a  layered  service  architecture  to  eliminate  redundancy  of  some  capabilities. 
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The  evolution  of  system  integration  has  broadened  the  stovepipes  that  were  so  narrow  in 
previous  system  generations.  The  resulting  view  is  of  a  few  very  broad  systems  made  up  of 
many  small  applications,  any  of  which,  may  be  accessible  through  the  workstation  in  front  of 
the  user.  Some  applications  work  on  common  data  managed  through  centralised  services. 
However,  many  categories  of  data  still  form  separate  stovepipes  as  they  are  maintained  in 
separate  data  servers  due  their  differing  technical  natures  and  programmatic  backgrounds.  It  is 
left  to  mental  effort  by  the  users  to  associate  the  tactical  situation  shown  in  one  application 
with  the  results  of  a  logistical  query  conducted  through  another. 

Information  Complexity 

The  focus  on  systems  integration  ignores  the  true  goal  in  decision  support.  It  is  the 
information  that  is  of  ultimate  value  to  the  decision-makers.  Integrating  the  information  is  the 
next  step.  However,  military  information  is  not  a  simple  matter  of  collecting  and  crunching 
sales  and  inventory  figures  from  various  branch  offices  as  found  in  data  warehousing 
applications.  The  domain  of  the  military  environment  is  complex.  The  variety  of  concepts, 
events,  and  situations  that  can  be  described  subjectively  or  measured  and  reported  objectively 
is  probably  limitless.  No  ontological  study  can  a  priori  determine  all  of  the  possible  data  types 
needed  to  describe  the  military  environment.  Therefore,  information  integration  is  not  going  to 
be  completely  accomplished  through  bringing  all  data  into  a  relational  or  object  database. 

A  Pattern  of  Analysis 

In  researching  the  requirements  for  an  intelligence  support  system  for  the  United  States 
Defense  Intelligence  Agency  (DIA),  a  pattern  of  analysis  was  uncovered  that  was  common  to 
those  used  in  some  other  domains.  The  primary  feature  of  this  pattern  is  that  an  analyst’s  role 
is  to  create  associations  among  existing  data.  An  analyst  rarely  creates  data,  but  searches, 
filters,  reviews  all  available  information.  As  they  do,  they  form  networks  of  related  information 
[Lange,  1999] 

Current  practice  involves  DIA  intelligence  analysts  spending  a  portion  of  their  time  building  up 
a  private  model  of  their  area  of  expertise.  The  remainder  of  their  time  is  spent  responding  to 
queries  from  DIA’s  various  customers.  The  responses  typically  take  the  form  of  linear  essays. 
Analysts  also  periodically  produce  background  reports  on  particular  matters  of  interest.  These 
too  take  a  strictly  linear  book-like  form  even  when  delivered  over  a  computer  network. 

The  results  of  the  current  approach  included  the  following  problems: 

•  The  products  were  static  or  updated  on  a  paper  publishing  timeframe 

•  Inability  for  customers  with  local  information  to  share  with  others 

•  Only  a  particular  question  gets  answered,  even  if  it  was  not  the  correct  question 

•  Analyst  turnover  causes  a  large  loss  of  knowledge. 

As  a  result  of  these  insights,  work  was  initiated  to  find  a  way  of  recording  the  knowledge 
being  built  by  the  intelligence  analyst  and  communicating  this  knowledge  to  intelligence 
consumers.  The  goal  was  to  move  away  from  the  linear  essay  and  its  strict  segregation  of 
reader  and  writer  roles  to  a  more  collaborative  method  of  communications  that  would  allow 
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for  continuous  update  of  the  knowledge  jointly  held  between  the  intelligence  agency  and  its 
customers. 

Recording  Decisions 

Decisions  also  take  the  form  of  associations  among  data  or  information  elements.  A  classic 
example  may  be  the  order  for  a  surface  combatant  to  engage  a  hostile  aircraft.  The  decision¬ 
maker  did  not  create  the  aircraft  or  the  positional  and  attribute  data  held  on  that  aircraft. 
Likewise,  the  surface  combatant’s  information  was  not  generated  by  the  decision-maker.  The 
value  added  by  the  decision-maker  is  that  an  engagement  relationship  (perhaps  with  other 
amplifying  information)  should  exist  between  the  two. 

As  the  data  on  the  two  combatants  changes,  the  association  must  be  reviewed  but  is  not 
necessarily  invalidated.  Likewise,  a  reversal  of  the  decision  changes  the  relationship  among  the 
combatants  but  does  not  change  any  of  their  individual  data.  This  fundamental  distinction 
between  the  structural  representation  of  the  associations  among  concepts  or  real-world  objects 
and  the  content  that  describes  them  is  common  between  the  knowledge  created  by  analysts 
and  decision-makers. 

Hypermedia  Architectures 

One  mechanism  for  automating  the  management  of  information  that  is  structured  in  the 
manner  described  above  is  through  hypermedia  systems.  Such  systems  provide  an  ability  to 
work  with  a  wide  variety  of  data  while  utilising  the  powerful  information  present  through  the 
structures  created  by  the  connections  made  among  the  various  data  items  [Numberg,  et.  al., 
1997].  Despite  accurately  recording  information,  the  non-linearity  of  hypermedia  allows  the 
reader  to  access  information  in  ways  not  necessarily  anticipated  by  the  author.  In  this  way, 
users  of  analysis  results  can  make  new  discoveries  from  the  same  body  of  data  [Nielson, 

1990],  Likewise,  distribution  of  responsibilities  in  a  large  command  and  control  environment  is 
aided  by  ensuring  that  not  all  uses  of  the  data  must  be  preconceived  though  accurate 
representation  of  constraints  is  essential. 

The  basic  features  of  hypermedia  systems  are  as  follows: 

Node.  A  node  is  an  object  that  represents  a  document  or  some  other  media  element. 

Link.  Links  are  used  to  create  relationships  among  nodes. 

Anchor.  Anchors  connect  nodes  to  the  actual  media  that  make  up  their  content. 

Open  Hypermedia  Systems 

From  1987  to  1991  researchers  noted  that  the  hypermedia  systems  of  the  time  did  not  support 
the  needs  of  collaborative  work  groups  and  that  they  could  not  be  integrated  into  computing 
environments  being  used  in  large  enterprises  [Halasz,  198 7] [Malcolm,  et  al.,  1991], 
Requirements  were  found  for  hypermedia  systems  that  were  not  being  addressed.  These 
included: 
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•  Interoperability  to  access  and  link  information  across  arbitrary  platforms,  applications,  and 
data  sources. 

•  Link  and  node  attributes  to  record  who  made  a  link,  what  the  permissions  are  for  the 
particular  link  or  node  and  other  management  information. 

•  Link  anchors  that  allow  attachment  to  the  exact  data  desired. 

•  Link  types  to  provide  more  information  about  the  meaning  of  a  particular  link  and  what 
functions  the  link  is  intended  to  support. 

•  Public  and  private  links  to  support  collaborative  environments. 

•  T emplates  for  automating  routine  analysi  s  tasks. 

•  Navigational  aids  that  can  act  as  filters  and  supply  powerful  querying  mechanisms. 

•  Configuration  control  so  that  information  of  importance  in  an  analysis  effort  can  be 
developed  and  managed  in  hypertext. 

To  address  these  requirements,  open  hypermedia  systems  evolved.  Open  hypermedia  systems 
have  been  defined  as  those  that  exhibit  the  following  characteristics  [Davis,  et.  al.,  1992]: 

•  A  system  that  does  not  impose  any  markup  on  the  data.  By  marking  up  data  in  order  to 
create  hyperlinks,  the  data  is  changed  making  it  inaccessible  to  systems  that  cannot  handle 
the  markup. 

•  A  system  that  can  be  integrated  with  any  tool  that  runs  under  the  host  operating  system. 
This  can  be  extended  to  mean  a  system  that  can  be  integrated  with  distributed  object 
environments. 

•  A  system  in  which  data  and  processes  may  be  distributed  across  a  network,  and  across 
hardware  platforms. 

•  A  system  in  which  there  is  no  artificial  distinction  between  readers  and  authors.  This  is 
quite  important  for  systems  supporting  analysis. 

•  A  system  to  which  new  functionality  can  be  easily  added. 

Since  analysts  and  decision-makers  are  simultaneously  readers  and  authors  of  node  contents 
and  links,  these  characteristics  are  vital  in  an  information  support  environment.  Likewise,  the 
ability  to  link  objects  without  changing  them  is  critical.  The  information  being  linked  together 
by  the  analysts  may  be  coming  from  other  applications  and  databases  with  which  the 
hypermedia  system  has  been  integrated.  These  applications  will  not  understand  changes 
imposed  on  the  data  in  order  to  support  linking.  The  links  must  be  separated  from  the  content. 
This  is  the  basic  premise  of  open  hypermedia  system  and  has  been  demonstrated  in  many 
research  systems  [Will,  1997], 

The  prototypical  open  hypermedia  system  is  structured  according  to  figure  1  below. 
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Figure  1  Open  Hypermedia  Architecture  [Wiil  and  Numberg,  1999] 


Interaction  Methods 

Several  other  hypermedia  categories  contribute  capabilities  necessary  to  support  analysis 
functions.  Chief  among  these  is  graph-based  hypermedia.  Graph-based  hypermedia  is  based  on 
set  and  graph  theory,  providing  mathematically  defined  filter,  search,  and  navigation  methods. 
This  category  of  hypermedia  gets  its  name.from  the  graphical  depictions  used  for  the  human- 
machine  interface. 

The  idea  of  a  schema  made  of  node  and  link  types  provides  the  basis  for  much  of  this  method’s 
power.  [Lucarella  and  Zanzi,  1996]  The  relationships  among  schema  types  and  between 
schema  entries  and  the  instances  created  in  the  hypermedia  mirror  the  relationships  in  object- 
oriented  design  closely. 

One  result  of  the  typing  found  in  graph-based  hypermedia  systems  is  that  the  resulting 
hypermedia  forms  a  semantic  network.  Semantic  networks  are  used  to  model  concepts  and 
real-world  situations,  making  them  a  natural  tool  for  modelling  a  tactical  situation  or  the 
results  of  intelligence  analysis. 
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HyperObject  Processing  System 

The  design  of  the  HyperObject  Processing  System  (HOPS)  inherits  features  from  both  open 
hypermedia  systems  and  graph-based  hypermedia  systems.  Some  modification  to  the 
established  research  architectures  was  required  in  order  to  support  analysis  of  the  kind 
performed  by  DIA.  These  same  modifications  would  appear  to  be  important  for  related  C4ISR 
systems. 

General  Architecture 

HOPS  follows  the  open  hypermedia  form  with  an  architecture  shown  in  figure  2  below. 


Figure  2  HOPS  Architecture 


However,  in  HOPS,  the  hypermedia  services  are  performed  by  hyperobject  multimedia 
information  systems  (HOMIS)  similar  in  function  to  the  graph-based  MIS  of  the  MORE 
system  [Lucarella  and  Zanzi,  1996],  Each  HOMIS  has  a  schema  and  instance  set,  perspectives 
(‘P’  in  figure  2)  and  filters  can  be  defined,  and  graph-based  navigation  interactions  are 
possible.  (Other  abbreviations  in  the  figure  are  as  follows:  ‘RT’  runtime  application;  ‘ORB’ 
object  request  broker;  ‘PS’  presentation  specification). 

Unique  Hypermedia  Features 

Most  hypermedia  systems  found  in  the  research  literature  work  with  information  spaces 
constrained  by  the  level  of  diversity  and  quantity  of  the  information,  restrictions  on  the 
structure  of  information,  or  by  limited  change  of  the  underlying  data.  Several  aspects  of  HOPS 
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are  unique  among  hypermedia  systems.  The  features  have  been  found  to  be  necessary  to  allow 
HOPS  to  handle  the  dynamic  unbounded  nature  of  military  information  integration. 

Multiple  Anchors. 

The  middle  layer  of  HOPS  holds  the  semantic  network.  Classical  hypermedia  systems  use  a 
node  to  represent  a  piece  of  media  and  anchor  to  a  single  media  element  to  provide  content. 
This  forms  a  semantic  network  describing  the  relationships  among  media  elements  rather  than 
describing  a  tactical  situation.  To  remedy  this,  HOPS  uses  multiple  anchors  per  atomic  node. 
This  allows  the  nodes  to  define  concepts  or  real-world  objects  and  the  links  to  represent 
relationships  among  them  rather  than  relationships  among  the  content  elements. 

Large  Open  Ended  Schema. 

Schemas  imply  an  ability  to  predict  all  of  the  types  of  information  to  be  used  and  the  entire 
range  of  associations  that  will  exist  among  the  elements.  In  some  domains  this  is  possible,  but 
not  in  the  military  information  domain  [Lange,  1999],  An  example  can  be  demonstrated  in 
terms  of  exercise  plans.  During  Tandem  Thrust  97  (TT97),  one  of  the  primary  requirements 
concerned  protecting  the  Great  Barrier  Reef.  Environmental  mitigation  strategies  and 
environmental  reports  are  not  typically  found  in  the  command  and  control  systems  of  our 
armed  forces.  There  will  always  be  unpredicted  situations  in  warfare  and  militaiy  exercises. 
Information  systems  must  be  able  to  adapt  on  the  fly  to  allow  analysts  and  decision  makers  to 
see  and  interpret  information  and  record  and  inform  of  decisions.  The  HOPS  design  allows 
users  to  include  information  not  accounted  for  in  the  schema  through  the  object-oriented 
method  of  deriving  all  nodes  and  links  from  common  ancestors.  This  allows  users  to  bypass 
rules  in  the  schema  and  connect  nodes  and  links  in  ways  not  previously  predicted. 
Administration  tools  then  allow  the  hypermedia  to  be  updated  to  bring  the  new  concepts  into 
the  schema  to  allow  autonomous  tools  to  process  the  information  more  easily. 

Analysis  schemas  and  instance  sets  can  become  quite  large.  The  problems  being  modelled  are 
quite  complex.  The  size  of  the  schema  represents  the  complexity  of  the  model,  while  the  size 
of  the  instance  set  represents  the  quantity  of  information.  Consumers  of  the  analysis  model 
must  be  able  to  filter  both  in  terms  of  complexity  and  in  terms  of  size  of  the  knowledge  base 
that  they  work  with  in  order  to  avoid  being  overwhelmed.  HOPS  allows  this  capability 
through  adaptations  of  the  graph-based  hypermedia  concepts  of  perspective  patterns  and 
filters  [Lange,  1997],  Perspective  patterns  allow  the  user  to  limit  the  kinds  of  information 
being  worked  with,  while  filters  focus  attention  on  information  with  particular  content. 

Link  and  Anchor  Integrity. 

When  important  decisions  are  being  made  based  on  the  information  presented,  error  is  less 
tolerable  than  in  our  daily  workings  with  the  World  Wide  Web.  Anchored  content  must  not 
disappear  unexpectedly.  Likewise  if  content  changes,  then  the  model  must  be  re-evaluated  to 
determine  if  it  is  still  valid.  The  typed  links  of  the  storage  layer  must  also  be  carefully  managed 
to  prevent  dangling  links.  HOPS  accomplishes  these  goals  by  caching  anchored  content,  and 
providing  periodic  checks  using  an  autonomous  change  detection  agent.  Agents  used  for  this 
purpose  can  use  whatever  rules  suit  the  application. 
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Link  Equality. 

Although  hypermedia  relies  on  associations  between  elements  for  its  character,  many  of  the 
interaction  techniques  found  in  research  literature  is  still  focused  on  the  content  (e.g.,  string 
matching  filters  and  searches,  searches  on  images).  Links  are  primarily  used  for  navigation. 

This  may  be  because  in  many  applications,  links  are  addresses,  used  to  point  to  more 
information  or  typed  paths  to  get  to  related  nodes.  Since  the  primary  value  added  by 
intelligence  analysts  and  decisions  makers  is  found  in  the  associations  among  elements,  authors 
and  readers  of  the  products  will  want  the  ability  to  interact  with  typed  links  in  ways  other  than 
simply  using  them  navigation.  They  themselves  convey  critical  information.  HOPS  handles  this 
by  making  links  special  types  of  nodes.  This  allows  all  of  the  mathematics  of  filtering, 
searching,  and  browsing  to  work  on  links.  [Lange,  1997], 

Framework 

HOPS  itself  is  not  command  and  control  system  or  an  analysis  system.  HOPS  is  a  hypermedia 
framework  designed  to  support  analysis  and  provides  some  generic  applications  for  interacting 
with  the  hypermedia.  The  way  that  HOPS  is  intended  to  be  used  is  by  adding  domain  specific 
applications  along  with  an  initial  schema  to  create  an  analysis  system  of  the  type  needed.  This 
has  been  done  in  support  of  DIA’s  mission. 

In  the  Military  Operations  in  Built-up  Areas  project,  HOPS  was  integrated  with  the 
Lightweight  Extensible  Information  Framework  (LEIF)  to  provide  geographic  and  temporal 
views  of  the  hypermedia.  An  intelligence  product  creation  wizard  and  intelligence  specific 
anchors  were  also  used.  Together  with  the  generic  applications  that  exist  within  the 
framework,  users  have  a  variety  of  ways  of  working  with  the  information. 

Prospects  for  Information  Integration 

Hypermedia  systems  hold  promise  for  information  integration.  Any  number  of  decision 
support  tools  can  access  the  semantic  network  formed  of  the  associations  and  nodes.  Since  the 
hypermedia  can  be  made  from  information  elements  from  all  available  systems,  decision¬ 
makers  can  have  access  to  all  of  the  information  they  need.  While  the  semantic  network  is 
serving  higher  level  decision  tools,  the  content  is  left  untouched  and  still  accessible  by  those 
tools  that  interact  directly  with  content  databases. 

Beyond  executing  applications  from  a  single  workstation,  integrated  information  could  provide 
decision-makers  with  a  competitive  advantage.  In  particular  an  integration  method  that  brings 
the  information  into  a  semantic  network  can  allow  meaningful  access  to  humans  and 
autonomous  agents.  The  goal  of  command  and  control  systems  should  be  to  integrate 
information  rather  than  just  the  applications.  An  architecture,  such  as  that  used  for  HOPS, 
centred  on  the  structure  of  information  can  accomplish  this  goal.  Military  plans,  tactical 
situations,  and  their  interaction  can  be  described  using  hypermedia  induced  semantic  networks. 
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